The Automatic Explanation of Multivariate Time Series

نویسنده

  • Allan Tucker
چکیده

Due to the advances in data capture and storage techniques over the last decade, the size of Multivariate Time Series (MTS) data being recorded has grown massively. Many of these MTS are characterised by a large number of interdependent variables with large possible time lags. If new and useful knowledge is to be automatically learnt from this type of data in order to aid the understanding of the underlying processes, a paradigm must be identified that is capable of modelling data with these characteristics but at the same time exhibiting transparency in how it models the data. A key challenge is that the number of possible models is very large since it does not only depend on the number of time series variables, but also on the size of possible time lags between ‘causes’ and ‘effects’. In this thesis a general framework is described for automatically learning probabilistic models from MTS with large time lags and high dimensionality in order to explain the underlying processes involved. Specifically, a novel method to learn dynamic Bayesian networks for explanation from these series is developed. This involves an efficient pre-processing stage, which effectively groups MTS variables in order to reduce the dimensionality of the problem. After pre-processing, a combination of Evolutionary Programming, Genetic Algorithms and heuristics is used to speed up convergence when learning models. In addition, an approach is looked at for the off-line learning of dynamic Bayesian networks with changing dependency structures. All experiments have been carried out on a mixture of synthetic and real data taken from an oil refinery repository. The resultant models are used to generate explanations that are evaluated in several ways, including reviewing the feedback from chemical process engineers. These results have demonstrated that the proposed framework is very promising in terms of both efficiency and accuracy. Allan Tucker The Automatic Explanation of Multivariate Time Series Birkbeck College

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Univariate, Multivariate and Combined Time Series Model to Prediction and Estimation the Mean Annual Sediment (Case Study: Sistan River)

Erosion, sediment transport and sediment estimate phenomenon with their damage in rivers is a one of the most importance point in river engineering. Correctly modeling and prediction of this parameter with involving the river flow discharge can be most useful in life of hydraulic structures and drainage networks. In fact, using the multivariate models and involving the effective other parameter...

متن کامل

Missing data imputation in multivariable time series data

Multivariate time series data are found in a variety of fields such as bioinformatics, biology, genetics, astronomy, geography and finance. Many time series datasets contain missing data. Multivariate time series missing data imputation is a challenging topic and needs to be carefully considered before learning or predicting time series. Frequent researches have been done on the use of diffe...

متن کامل

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

Identification of outliers types in multivariate time series using genetic algorithm

Multivariate time series data, often, modeled using vector autoregressive moving average (VARMA) model. But presence of outliers can violates the stationary assumption and may lead to wrong modeling, biased estimation of parameters and inaccurate prediction. Thus, detection of these points and how to deal properly with them, especially in relation to modeling and parameter estimation of VARMA m...

متن کامل

Time series forecasting of Bitcoin price based on ARIMA and machine learning approaches

Bitcoin as the current leader in cryptocurrencies is a new asset class receiving significant attention in the financial and investment community and presents an interesting time series prediction problem. In this paper, some forecasting models based on classical like ARIMA and machine learning approaches including Kriging, Artificial Neural Network (ANN), Bayesian method, Support Vector Machine...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006